Mixing patterns in networks 
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We study assortative mixing in networks, the tendency for vertices in networks to be connected to 
other vertices that are like (or unlike) them in some way. We consider mixing according to discrete 
characteristics such as language or race in social networks and scalar characteristics such as age. 
As a special example of the latter we consider mixing according to vertex degree, i.e., according to 
the number of connections vertices have to other vertices: do gregarious people tend to associate 
with other gregarious people? We propose a number of measures of assortative mixing appropriate 
to the various mixing types, and apply them to a variety of real-world networks, showing that 
assortative mixing is a pervasive phenomenon found in many networks. We also propose several 
models of assortatively mixed networks, both analytic ones based on generating function methods, 
and numerical ones based on Monte Carlo graph generation techniques. We use these models to 
probe the properties of networks as their level of assortativity is varied. In the particular case of 
mixing by degree, we find strong variation with assortativity in the connectivity of the network and 
in the resilience of the network to the removal of vertices. 



I. INTRODUCTION 

The techniques of statistical physics were developed to 
study the properties of systems of many interacting parti- 
cles, atoms, or molecules, but their applicability is wider 
than this, and recent work has fruitfully applied these 
techniques to economies, ecosystems, social interactions, 
the Internet, and many other systems of current interest. 
The component parts of these systems, the analogs of 
atoms and molecules, such things as traders in a market, 
or computers on the Internet, are not usually connected 
together on a regular lattice as the atoms of a crystal are. 
Nor indeed do their patterns of connection normally fit 
any simple low-dimensional structure. Instead they fall 
on some more generalized "network," which may be more 
or less random depending on the nature of the system. 
The broadening of the scope of statistical physics to cover 
these systems has therefore led us to the consideration of 
the structure and function of networks, as one of the fun- 
damental steps to understanding real-world phenomena 
of many kinds. Useful reviews of work in this area can 
be found in Refs. [J 

Recent studies of network structure have concentrated 
on a small number of properties that appear to be 
common to many networks and can be expected to af- 
fect the functioning of networked systems in a funda- 
mental way. Among these, perhaps the best studied 
are the "small- world effect" @, network transitiv- 
ity or "clustering" and degree distributions 0, Q- 
Many other properties however have been examined and 
may be equally important, at least in some systems. 
Examples include resilience to the deletion of network 
nodes 0, Wi [Tol [Tl[ JT3 , navigability or searchability of 
networks |l3t Il4 Il5l, community structure 0, 0, 0] , 
and spectral properties 0, |2(J, |2l|. In this paper 
we study another important network feature, the cor- 
relations between properties of adjacent network nodes 
known in the ecology and epidemiology literature as "as- 
sortative mixing." 



The very simplest representation of a network is a col- 
lection of points, usually called vertices or nodes, joined 
together in pairs by lines, usually called edges or links. 
More sophisticated network models may introduce other 
properties of the vertices or the edges. Edges for example 
may be directed — they point in one particular direction — 
or may have weights, lengths, or strengths. Vertices can 
also have weights or other numerical quantities associ- 
ated with them, or may be drawn from some discrete set 
of vertex types. In the study of social networks, the pat- 
terns of connections between people in a society, it has 
long been known that edges do not connect vertices re- 
gardless of their property or type. Patterns of friendship 
between individuals for example are strongly affected by 
the language, race, and age of the individuals in ques- 
tion, among other things. If people prefer to associate 
with others who are like them, we say that the network 
shows assortative mixing or assortative matching. If they 
prefer to associate with those who are different it shows 
disassortative mixing. Friendship is usually found to be 
assortative by most characteristics. 

Assortative mixing can have a profound effect on the 
structural properties of a network. For example, assor- 
tative mixing of a network by a discrete characteristic 
will tend to break the network up into separate commu- 
nities. If people prefer to be friends with others who 
speak their own language, for example, then one might 
expect countries with more than one language to sepa- 
rate into communities by language. Assortative mixing 
by age could cause stratification of societies along age 
lines. And while the main focus of this paper is on social 
networks, it is reasonable to suppose that similar mixing 
effects are seen in non-social networks also. We will give 
some examples of this in Section Hll Al 

In this paper we study assortative mixing of various 
types using empirical network data, analytic models, and 
numerical simulation. We demonstrate that assortative 
(or disassortative) mixing is indeed present in many net- 
works, show how it can be measured, and examine its 
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0.289 


0.204 
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TABLE I: The mixing matrix etj and the values of at and 
hi for sexual par tnerships in the study of Catania et al. [23|. 
After Morris gj. 



effect on network structure and behavior. The outline 
of the paper is as follows. In Section [D] we study the 
effects of assortative mixing by discrete characteristics 
such as language or race. In Section IlIII we study mixing 
by scalar properties such as age and particularly vertex 
degree; since degree is itself a property of the network 
topology, the latter type of mixing leads to some novel 
network structures not seen with other types. In Sec- 
tion II VI we give our conclusions. A preliminary report of 
some of the results in this paper has appeared previously 
asRef.E! 



II. DISCRETE CHARACTERISTICS 

In this section we consider assortative mixing accord- 
ing to discrete or enumerative vertex characteristics. 
Such mixing can be characterized by a quantity ey , which 
we define to be the fraction of edges in a network that 
connect a vertex of type i to one of type j. On an undi- 
rected network this quantity is symmetric in its indices 
e ij = e j*' wnue on directed networks or bipartite net- 
works it may be asymmetric. It satisfies the sum rules 



£< 



(i) 



where a,j and hi are the fraction of each type of end of an 
edge that is attached to vertices of type i. On undirected 
graphs, where the ends of edges are all of the same type, 
ai = bi 

For example, Table [I] shows data on the values of 
for mixing by race among sexual partners in a 1992 study 
carried out in the city of San Francisco, California [23j. 
This part of the study focused on heterosexuals, so this 
is a bipartite network, the two vertex types representing 
men and women, with edges running only between ver- 
tices of unlike types. This means that in this case the 
ends of an edge are different and the matrix is asym- 
metric. As the table shows, mixing is highly assortative 
in this network, with individuals strongly preferring part- 
ners from the same group as themselves. 



A. Measuring discrete assortative mixing 

To quantify the level of assortative mixing in a network 
we define an assortativity coefficient thus: 



J2i a i b i Tr e 



Ei a i b i 



(2) 



where e is the matrix whose elements are and || x || 
means the sum of all elements of the matrix x. This 
formula gives r — when there is no assortative mixing, 
since e^ — aibj in that case, and r = 1 when there is 
perfect assortative mixing and J2i e a = 1 • If the network 
is perfectly disassortative, i.e., every edge connects two 
vertices of different types, then r is negative and has the 
value 



E, 



(3) 



which lies in general in the range — 1 < r < 0. One 
might ask what this value signifies. Why do we not sim- 
ply have r = — 1 for a perfectly disassortative network? 
The answer is that a perfectly disassortative network is 
normally closer to a randomly mixed network than is a 
perfectly assortative network. When there are several dif- 
ferent vertex types (e.g., four in the case shown in Tabled 
then random mixing will most often pair unlike vertices, 
so that the network appears to be mostly disassortative. 
Therefore it is appropriate that the value r = for the 
random network should be closer to that for the perfectly 
disassortative network than for the perfectly assortative 
one. 

A quantity with properties similar to those of Eq. 
has been proposed previously by Gupta et al. 25] . How- 
ever the definition of Gupta et al. gives misleading results 
in certain situations, such as, for example, when one type 
of vertex is much less numerous than other types, as is the 
case m Table |T| In this paper therefore we use Eq. 
which doesn't suffer from this problem. The difference 
between the two measures is discussed in more detail in 
Appendix ^ 

Using the values from Table [I] in Eq. we find that 
r = 0.621 for the network of sexual partnerships, imply- 
ing, as we observed already, that this network is strongly 
assortative by race — individuals draw their partners from 
their own group far more often than one would expect on 
the basis of pure chance. 

As another example of the application of Eq. (0) , con- 
sider the network studied by Girvan and Newman 0] 
representing the timetable of American college football 
games, in which vertices represent universities and col- 
leges, and edges represent regular season games between 
teams during the year in question. Colleges are grouped 
into "conferences," which are defined primarily by geog- 
raphy, and teams normally play more often against other 
teams in their own conference than they do against teams 
from other conferences. In other words, there should be 
assortative mixing of colleges by conference in the sched- 
ule network. For the 2000 season schedule studied in 
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Ref.Gl we find a value for the assortativity coefficient 
of r = 0.586, again indicating strong assortative mixing, 
i.e., colleges do indeed play games with their conference 
partners to a substantially greater degree than one would 
expect in a randomly mixed network. 

It is also useful to know the expected statistical error 
on the value of r, so that we can evaluate the signifi- 
cance of our results. One way to calculate this error is 
to use the jackknife method |2g|. Regarding each of the 
M edges in a network as an independent measurement 
of the contributions to the elements of the matrix e, we 
can show that the expected standard deviation oy on the 
value of r satisfies 



(4) 



where rj is the value of r for the network in which the ith 
edge is removed. For example, in the case of the matrix of 
Tablefflthis gives ay = 0.014, which, when compared with 
the value r = 0.621 shows that our finding of assortative 
mixing is strongly statistically significant — a 40er result. 

Although it has a rather different physical interpreta- 
tion, the coefficient r is mathematically similar to the 
intraclass correlation coefficients used in statistics to 
compare measurements across different groups in a pop- 
ulation Standard results for errors on intraclass cor- 
relations can be adapted to the present case to show that 
another estimate of the error on r is [281 



are divided among the n possible vertex types accord- 
ing to some partition {r%,r2, ■ ■ -r n }, where J2j r j = 
The probability that the partition {rj} takes a particu- 
lar value is given by the multinomial distribution 



p«(fc,{^})=fc!n- 



J2j e u 



(6) 



Now, generalizing Ref. l34l we define a generating function 
for the distributions of the numbers of edges connecting 
to each type of vertex: 



OC 

k=0 { rj } j 

Performing the sum over {rj}, this gives 

Ej" e ij X j 



■x n ) = J^Pk 
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Ej e ij x j 



where 



Gg°(*)=z*i 



(7) 



(8) 



(9) 



2 _ 1 E t a ^ + [Li a M - a i b * ^ E t g<jg 
<Jr ~ m i-E^A ' (5) 

which gives a r = 0.012 for the data of Table [Q com- 
parable with the jackknife method. Either method for 
estimating oy will be adequate for most purposes — the 
choice between them is a matter of convenience. 



is the fundamental generating function for the degree dis- 

(i) I I 

tribution p k , as defined in Ref. I34L Similarly, for the 
edges connected to a vertex of type i reached by follow- 
ing a randomly chosen edge on the graph we have 



G\'{x 1 ,x 2l - ■ .x n ) = G\'\ 



E 



(10) 
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with 



B. Models of discrete assortative networks 

The generalized random graph models of networks 
studied in the past by various authors [2j| l30l l3lL l32l 
l33l I34L l35| can be extended to the case of assortative 
mixing on discrete characteristics. Suppose we are told 

(i) 

the degree distribution p k for vertices of type i = 1 . . . n 
in a network and the value of the mixing matrix e.y . Im- 
plicitly, we are also told the values of the quantities a* 
and bi, since we can extract them from using Eq. 
We consider the ensemble of all graphs with these val- 
ues of p k ^ and e jj , which gives us a random graph model 
similar in spirit to that of Refs. |2^, I30T l3ll l32l 13a . I34I l35l 
for the case of specified degree distribution only. Many 
properties of this ensemble can be calculated exactly in 
the limit of large system size, as we now demonstrate. 

Suppose that a vertex of type i has degree k. The k ver- 
tices at the other ends of the edges attached to this vertex 



G?{x) 



(<)™fc-i 



1 da 



(i) 



Erf 



da; 



(11) 



where Zi = G (1) is the mean degree for type i vertices. 

Now we consider the total number of vertices reachable 
by following an edge that arrives at a vertex of type i. 
This number has a distribution that is generated by a 

(i) 

generating function H\ satisfying a self-consistency con- 
dition of the form 

H i t\x)=xGf{H ( ?\x),...H ( r\x)). (12) 

And similarly the distribution of the number of vertices 
reachable from a randomly chosen vertex of type i is gen- 
erated by 

H^(x)=xG^(H[ 1 \x),...Hi n \x)). (13) 
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By solving these two equations simultaneously, we can 
derive the complete component size distribution in our 
model. Here however, we will just calculate some of the 
more important average statistics of our networks from 
the generating functions. For example, the mean number 
Si of vertices reachable from a vertex of type i is 



dH, 



dx 



= 1 + G%> (1) 



x=i t-ij e h 

We can write this in matrix form thus: 



m 



Hi(l), 



(14) 



(15) 



where mo is the matrix with elements Zidj/ai. Differen- 
tiating Eq. i|12|) . we then find that 



s = 1 + m • [I - mi] • 1, 



(16) 



where 1 is the vector whose elements are all 1, and the 
matrix mi has elements 



mi 



(») 

Z<2 &ij 
*W Chi 



(17) 



with z[ 1 ^ = zi being the mean degree of type i vertices 
and z\ being the mean number of neighbors at distance 
two from a type i vertex. 

When the density of edges on the graph is small, s in 
Eq. I|16|) is finite, but it grows with increasing density 
and then diverges when the determinant of the matrix 
I — mi reaches its first zero. This point marks the phase 
transition at which a giant component first appears in the 
network, similar to the phase transition seen in uncorre- 
lated random graphs. The condition det(I — mi) = for 
the position of the phase transition is the equivalent of 
the condition of Molloy and Reed [31] for the position of 
the phase transition in an uncorrelated random graph of 
arbitrary degree distribution. 

The size of the giant component can also be calculated 
in a straightforward manner. We define u.i to be the 
probability that a vertex of type i, reached by following 
a randomly chosen edge in the graph, does not belong 
to the giant component. This probability is precisely 
equal to the probability that none of the neighbors of that 
vertex are themselves members of the giant component, 
and hence Ui satisfies the self-consistency condition 



,u n ). 



(18) 



The probability that a randomly chosen vertex of type i 
is not a member of the giant component is then 
G ^ (u\ , . . . u n ) , and the overall fraction S of vertices that 
are in the giant component is given by 



(19) 



where we have made use of the fact that the fraction of 
vertices of type i in the network is equal to a-i/zi. 



The simultaneous solution of Eqs. (|18fl and (|19fl gives 
us the value of S for our network. In many cases we 
find that these equations are not solvable in closed form, 
but they can easily be solved numerically by iteration of 
Eq. (|18f) from suitable starting values for the Ui , and then 
substituting the result into l|19|) . 



C. Simulating discrete assortative networks 

We would also like to be able to generate random net- 
works with a given level of assortative mixing, in order to 
check our analytical results and also for use as substrates 
for other models, such as, for example, epidemiological 
models. A simple algorithm for achieving this is the fol- 
lowing. 

1. We first choose a size for our graph in terms of the 
number M of edges and we draw M edges from 
the distribution ey. That is, we generate M edges, 
each one identified by the types of the vertices that 
it connects, in some manner such that the fraction 
of edges connecting vertices of types i and j tends 
to eij as M becomes large. In practice, a simple 
transformation method works well |36j. 

2. We count the number of ends of edges of each 
type i, to give the sums rrii of the degrees of vertices 
in each class. We calculate the expected number rij 
of vertices of each type from rii = mi/ (rounded 
to the nearest integer) , where Zi is the desired mean 
degree of vertices of type i. 

3. We draw rij vertices from the desired degree dis- 

(i) 

tribution p k for type i. In general the degrees of 
these vertices will not sum exactly to rrii as we want 
them to. So we choose one vertex at random, dis- 

(i) 

card it, and draw another from the distribution p k 
until the sum does equal m,. 

4. We pair up the m, ends of edges of type i at ran- 
dom with the vertices we have generated, so that 
each vertex has the number of attached edges cor- 
responding to its chosen degree. 

5. We repeat from step 3 for each vertex type. 

This method correctly generates assortatively mixed 
graphs with the given e»j in the limit of large graph size. 
In Section IIII CI of this paper we give some examples of 
simulations of assortatively mixed networks. 



III. ASSORTATIVE MIXING BY SCALAR 
PROPERTIES 

A similar, but distinct, form of assortative mixing is 
mixing that depends on one or more scalar properties of 
network vertices. A classic example of mixing of this type 
seen in many social networks is assortative mixing by age. 
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FIG. 1: Top: scatter plot of the ages of 1141 married couples 
at time of marriage, from the 1995 US National Survey of 
Family Growth 1371 . Bottom: a histogram of the age differ- 
ences (male minus female) for the same data. 



As before, we can use the matrix e xy to define a mea- 
sure of assortativity. We first note that e xy satisfies the 
sum rules 



xy 



1. 



(20) 



where a x and b y are, respectively, the fraction of edges 
that start and end at vertices with values x and y. (On an 
undirected, unipartite graph, a x — b x .) Then, if there is 
no assortative mixing e xy = a x b y . If there is assortative 
mixing one can measure it by calculating the standard 
Pearson correlation coefficient thus: 



xy(e xy - a x b v ) 



(21) 



where a a and Ob are the standard deviations of the dis- 
tributions a x and b y . The value of r lies in the range 
— 1 < r < 1, with r — 1 indicating perfect assortativity 
and r — —1 indicating perfect disassortativity (i.e., per- 
fect negative correlation between x and y). For the age 
data from Fig. ^ f° r example, we find that r — 0.574, 
indicating strong assortative mixing once more. 

One can construct in a straightforward manner a ran- 
dom graph model of a network with this type of mixing 
exactly analogous to the model presented in Section lll Bl 
It is also possible to generate random representative net- 
works from the ensemble denned by e xy using the algo- 
rithm described in Section III CI In this paper however, 
rather than working further on the general type of mixing 
described here, we will concentrate on one special exam- 
ple of assortative mixing by a scalar property which is 
particularly important for many of the networks we are 
interested in, namely mixing by vertex degree. 



A. Mixing by vertex degree 



In Fig. ^ (top panel) we show a scatter plot of the ages 
of marriage partners in the 1995 US National Survey of 
Family Growth [37j . As is clear from the figure, there is 
a strong positive correlation between the ages, with most 
of the density in the distribution lying along a rough 
diagonal in the plot; people, it appears, prefer to marry 
others of about the same age, although there is some 
bias towards husbands being older than their wives. In 
the bottom panel of the same figure we show a histogram 
of the age differences in the study, which emphasizes the 
same conclusion [Trij . 

By analogy with the developments of Section^ we can 
define a quantity e xy , which is the fraction of all edges 
in the network that join together vertices with values x 
and y for the age or other scalar variable of interest. The 
values x and y might be either discrete in nature (e.g., in- 
tegers, such as age to the nearest year) or continuous (ex- 
act age) , making e xy either a matrix or a function of two 
continuous variables. Here, for simplicity, we concentrate 
on the discrete case, but generalization to the continuous 
case is straightforward. 



In general, scalar assortative mixing of the type de- 
scribed above requires that the vertices of the network of 
interest have suitable scalar properties attached to them, 
such as age or income in social networks. In many cases, 
however, data are not available for these properties to 
allow us to assess whether the network is assortatively 
mixed. But there is one scalar vertex property that is 
always available for every network, and that is vertex 
degree. So long as we know the network structure we 
always know the degree of a vertex, and then we can 
ask whether vertices of high degree preferentially asso- 
ciate with other vertices of high degree. Do the gregari- 
ous people hang out with other gregarious people? This 
has been a topic of considerable discussion in the physics 
literature Ha E3, E3, As we will show, many 

real-world networks do show significant assortative (or 
disassortative) mixing by vertex degree. 

Assortative mixing by degree can be quantified in ex- 
actly the same way as for other scalar properties of ver- 
tices, using Eq. (|21|) . Taking the example of an undi- 
rected network and using the notation of Ref. I22T we 
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define ejk to be the fraction of edges that connect ver- 
tices of degrees j and k. In fact, we choose j and k to be 
the excess degrees of the vertices (also called remaining 
degree in Ref . 122^) . which are one less than the degrees 
of the vertices themselves. This is because in most cases 
we are interested in the number of edges attached to a 
vertex other than the particular edge we are looking at 
at the moment. 

If the degree distribution of the graph as a whole is pk, 
i.e., pk is the probability that a randomly chosen vertex 
will have degree k, then the excess degree of the vertex 
at the end of an edge is distributed according to [34| 



(k + l)p k +i 



(22) 



where z — £ fc kpk is the mean degree in the network. 
The distribution qk is related to ejk via 



J2 e J k = qk - 



(23) 



The correct assortativity coefficient for mixing by vertex 
degree in an undirected network is 



Y.ik3k{Z]k-q 3 qk) 



(24) 



where <j q is the standard deviation of the distribution qk- 
For a directed network the equivalent expression is 



(25) 



where ejk is now the probability that a randomly chosen 
directed edge leads into a vertex of in-degree j and out 
of a vertex of out-degree k [7^ . For the purposes of 
calculating r for an actual network of specified vertices 
and edges, we can rewrite this in the form 

V .,,/••, M : V. /( V./,v 



(26) 

where ji and fe« are the excess in-degree and out-degree 
of the vertices that the zth edge leads into and out of 
respectively, and M is again the number of edges. For 
an undirected network we can use the same formula — we 
simply replace each undirected edge by two directed ones 
leading in opposite directions. Alternatively, one can ap- 
ply directly the formula given in Ref. |22j, Eq. (4) , to the 
undirected network. As before, errors on the measured 
values of r can be calculated using the jackknife method 
and Eq. gj. 

In Table [n] we show the measured values of r for de- 
gree correlations in undirected and directed networks of 
a variety of different types, along with the expected er- 
rors on these values. The table reveals an interesting 
feature: essentially all the social networks examined are 



significantly assortative by degree, i.e., high degree ver- 
tices tend to be connected to other high degree vertices, 
while all the technological and biological networks are 
disassortative. Three of the values for r, for the network 
of student relationships, the power grid, and the graph 
of software dependencies, are null results, meaning that 
they are not statistically different from zero. All the oth- 
ers however fit the pattern clearly, with positive values 
of r for the social networks and negative values for all 
the others. 

What is the explanation of this phenomenon? In all 
probability, there are a number of different mechanisms 
at work. Some possibilities are the following: 

1. In the social networks it is entirely possible, and 
is often assumed in the sociological literature, that 
similar people do attract one another, and there- 
fore that there could be a real preference among 
gregarious people for association with other gregar- 
ious people, and similarly for hermits. 

2. On the other hand, the networks of collaborations 
between academics, actors, and businesspeople con- 
sidered here are affiliation networks, i.e., networks 
in which people are connected together by member- 
ship of common groups (authors of a paper, actors 
in a film, etc.). Since all members of a group are 
thus connected to all other members, the positive 
correlations between degrees may at least in part 
reflect the fact that the members of a large (small) 
group are connected to the other members of the 
same large (small) group. 

3. In the Internet and the World-Wide Web there 
may be organizational reasons for degree anti- 
correlation between vertices. The high-degree 
vertices in these networks are often connectivity 
providers (Internet) or directories (Web), which 
by definition tend to be connected to the "little 
people" — the individual service subscribers in the 
case of the Internet or the individual web-pages on 
the Web. 

4. Maslov and Sneppen [3!| have shown that disas- 
sortativity can be produced as a finite-size effect by 
the constraint that no two vertices in a network are 
connected by more than one edge. This constraint 
causes high-degree vertices to "repel" one another, 
producing negative values of r. This explanation 
could account for at least a part of the disassorta- 
tive mixing we see in the Internet, the protein and 
metabolic networks, and the food webs, although it 
cannot be applied directly to the Web and neural 
networks, for which vertex pairs can and often do 
have more than one connection. 

It appears therefore that some of the degree correlations 
we see in our networks could have real social or organiza- 
tional origins, while others may be artifacts of the types 
of networks we are looking at and the constraints that 
are placed on their structure. 
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social 



technological 



biological 



network 


type 


size n 


assort at ivity r 


error a 


r 


ref. 


physics coauthorship 


undirected 


52 909 


0.363 


0.002 




a 


biology coauthorship 


undirected 


1 520 251 


0.127 


0.0004 




a 


mathematics coauthorship 


undirected 


253 339 


0.120 


0.002 




b 


film actor collaborations 


undirected 


449 913 


0.208 


0.0002 




c 


company directors 


undirected 


7673 


0.276 


0.004 




d 


student relationships 


undirected 


573 


-0.029 


0.037 




e 


email address books 


directed 


16 881 


0.092 


0.004 




f 


power grid 


undirected 


4 941 


-0.003 


0.013 




g 


Internet 


undirected 


10 697 


-0.189 


0.002 




h 


World-Wide Web 


directed 


269 504 


-0.067 


0.0002 




i 


software dependencies 


directed 


3162 


-0.016 


0.020 




j 


protein interactions 


undirected 


O 1 ir 


— O.lOD 


0.U1U 




k 


metabolic network 


undirected 


765 


-0.240 


0.007 




1 


neural network 


directed 


307 


-0.226 


0.016 




m 


marine food web 


directed 


134 


-0.263 


0.037 




n 


freshwater food web 


directed 


92 


-0.326 


0.031 




o 



TABLE II: Size n, degree assortativity coefficient r, and expected error a r on the assortativity, for a number of social, 
technological, and biological networks, both directed and undirected. Social networks: coauthorship networks of (a) physicists 
and biologists |43| and (b) mathematicians |44|. in which authors are connected if they have coauthored one or more articles 
in learned journals; (c) collaborations of film actors in which actors are connected if they have appeared together in one or 
more movies H0; (d) directors of Fortune 1000 companies for 1999, in which two directors are connected if they sit on the 
board of directors of the same company 14511 ; (e) romantic (not necessarily sexual) relationships between students at a US high 
school 46]; (f) network of email address books of computer users on a large computer system, in which an edge from user A 
to user B indicates that B appears in A's address book |47| . Technological networks: (g) network of high voltage transmission 
lines in the Western States Power Grid of the United States [f|; (h) network of direct peering relationships between autonomous 
systems on the Internet, April 2001 48]; (i) network of hyperlinks between pages in the World-Wide Web domain nd.edu, circa 
1999 49]; (j) network of dependencies between software packages in the GNU/Liimx operating system, in which an edge from 
package A to package B indicates that A relies on components of B for its operation. Biological networks: (k) protein-protein 
interaction network in the yeast S. Cerevisiae [50|: (1) metabolic network of the bacterium E. Coli |5ll| : (m) neural network of 
the nematode worm C. Elcgans [EL l52l|_tropic interactions between species in the food webs of (n) Ythan Estuary, Scotland |53| 
and (o) Little Rock Lake, Wisconsin |54|l. 



B. Models of assortative mixing by degree 

In Ref. |22j we studied the ensemble of graphs that have 
a specified value of the matrix ejk and solved exactly for 
its average properties using generating function methods 
similar to those of Section lll Bl We showed that the phase 
transition at which a giant component first appears in 
such networks occurs at a point given by det(I — m) = 0, 
where m is the matrix with elements rrijk = kejk / qj ■ One 
can also calculate exactly the size of the giant component, 
and the distribution of sizes of the small components be- 
low the phase transition. While these developments are 
mathematically elegant, however, their usefulness is lim- 
ited by the fact that the generating functions involved 
are rarely calculable in closed form for arbitrary speci- 
fied ejk, and the determinant of the matrix I — m almost 
never is. In this paper, therefore, we take an alternative 
approach, making use of computer simulation. 

We would like to generate on a computer a random 
network having, for instance, a particular value of the 
matrix ejk- (This also fixes the degree distribution, via 
Eq. (ESI.) In Ref. ^ we discussed one possible way of 
doing this using an algorithm similar that of Section lll CI 
One would draw edges from the desired distribution e jfc 
and then join the degree k ends randomly in groups of k 
to create the network. (This algorithm has also been 



discussed recently by Dorogovtsev et al. |4(j.) As we 
pointed out, however, this algorithm is flawed because 
in order to create a network without any dangling edges 
the number of degree k ends must be a multiple of k for 
all k. It is very unlikely that these constraints will be 
satisfied by chance, and there does not appear to be any 
simple way of arranging for them to be satisfied without 
introducing bias into the ensemble of graphs. Instead, 
therefore, we use a Monte Carlo sampling scheme which is 
essentially equivalent to the Metropolis-Hastings method 
widely used in the mathematical and social sciences for 
generating model networks [5^, |5(| . The algorithm is as 
follows. 

1. Given the desired edge distribution ejk, we first 
calculate the corresponding distribution of excess 
degrees qk from Eq. and then invert Eq. l(2^|) 
to find the degree distribution: 



Pk 



qk-i/k 



(27) 



Note that this equation cannot tell us how many 
vertices there are of degree zero in the network. 
This information is not contained in the edge dis- 
tribution ejk since no edges connect to degree-zero 
vertices, and so must be specified separately. On 
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the other hand, most of the properties of networks 
with which we will be concerned here don't depend 
on the number of degree-zero vertices, so we can 
safely set po — for the purposes of this paper. 

2. We draw a degree sequence, a specific set ki of de- 
grees of the vertices i — 1 . . . N, from the distribu- 
tion pfc, and connect vertices together randomly in 
pairs to generate a random graph, as described, for 
instance, by Molloy and Reed |31| . 

3. We choose two edges at random from the graph. 
Let us denote these by the vertex pairs (v± , w± ) and 
(i>2,W2) that they connect. 

4. We measure the excess degrees j% , k± , j 2 , k 2 of the 
vertices Vi,Wi,V2, Wi and then we remove the two 
edges and replace them with two new ones [v\, v 2 ) 
and (wi , w 2 ) with probability 



3 &k\k2 



P = 



e jlki e j 2 k 2 
1 



^ e jlj2 e kik 2 < e jik 1 Gj 2 k 2 

otherwise. 



(28) 



5. Repeat from step 3. 

Clearly this swap procedure preserves the degree se- 
quence. It is also ergodic over the set of graphs with 
that degree sequence, i.e., it can reach any graph within 
that set in a finite number of moves. To see this, consider 
any configuration of the graph other than the desired tar- 
get configuration and choose any vertex pair that is not 
joined by an edge in that configuration but is joined by 
an edge in the target. These vertices must necessarily 
each be attached to at least one other edge that does 
not exist in the target configuration. Take these edges 
and perform the swap procedure on them. This always 
increases the number of edges that the configuration has 
in common with the target. And since it is always pos- 
sible to do this, it immediately follows that any target 
configuration can be reached in at most M such moves, 
where M is the number of edges in the network. (Ac- 
tually, M — 1 will suffice, since the last edge will always 
automatically be in the correct position by a process of 
elimination.) 

Our algorithm also satisfies detailed balance. We 
would like to sample graph configurations /i with proba- 
bilities FI 



M 

n 



■jiki 



(29) 



where ji , ki are the excess degrees of the vertices at the 
ends of the ith edge. It is trivial to show that with the 
choice of transition probabilities given in Eq. (|28|l . we 
satisfy detailed balance in the form 



Pn p (v -+v)= p v P{v -> fi), 
for all pairs fi, v of states. 



(30) 



Since our algorithm satisfies both ergodicity and de- 
tailed balance, it immediately follows |3y| that in the 
limit of long time it samples graph configurations cor- 
rectly from the distribution (|29|) . It also turns out to 
be a reasonably efficient algorithm in practice. In the 
simulations reported here, the mean fraction of proposed 
Monte Carlo moves that was accepted never fell below 
50% for any parameter values. 

To use this algorithm we also need to choose a value for 
the matrix ejk- We have a lot of freedom about how we 
do this. Suppose for example that we wish to simulate an 
undirected network, so that ejk is symmetric. A rank-n 
symmetric matrix has hn{n + I) degrees of freedom, n of 
which are fixed in this case by the requirement, Eq. 12311 . 
that the rows and columns sum to qk, leaving \n(n — 1) 
that can be freely chosen. (One could think for example 
of choosing the \n{ n — 1) off-diagonal elements of ejk and 
then satisfying l|23l) by choosing the n diagonal elements 
to give the row and column sums the desired values.) 

A simple example of a disassortative choice for ejk is 
the set of matrices taking the form 



„(<Q _ i „ _ 

°'jk Qj^k > ZjQk •KjVCki 



(31) 



where Xk is any distribution normalized such that 
J2k Xk = 1- It i s eas Y to see that this choice satisfies 
the constraints on ejk , and the resulting value of r is 



Td = 



(32) 



which is always negative (or zero). Here fi q and fi x are 
the means of the distributions qk and Xk respectively. 

Being a probability, ejk is also constrained to lie in the 
range < ejk < 1 for all j, k. To ensure that Eq. lj3~T|) 
never becomes negative we should choose Xk to decay 
faster than qk- 

Suppose for example that we are interested, as many 
people seem to be these days, in networks that have 
power-law degree distributions, pk ~ k~ T 0, 0, 0J 0, HE 
51 EHH Hi HJ I13 True power laws unfortunately 
are troublesome to deal with; the crucial distribution qk 
has a divergent mean unless r > 3, which it seldom is for 
real-world networks (see, for instance, Ref. 2, Table II). 
Instead, therefore, following Ref. 02 we nere examine the 
exponentially truncated power-law distribution 



-k/ k 



Pk 



for k > 1, 



(33) 



where the function Li n (x), which acts here as a normaliz- 
ing constant, is the nth polylogarithm of x. This gives a 
similar distribution qk ~ (k+ I)~ T+1 e - ( fe+1 '/ K for the ex- 
cess degree, and we choose Xk to have the same functional 
form, but with a different cutoff parameter k' , where 
k' < k, to ensure that ejk > 0. In Fig. [21 we show a 
plot of the resulting ejk for r = 2.5, K — 100, k' = 10. 
The disassortative nature of this choice for ejk is evident 
from the concentration of probability along the edges of 
the matrix in the figure. 
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k 




10 20 30 



FIG. 2: Density plot of the matrix e { f k > , Eq. l|3T). for r = 2.5, 
ft = 100, k' = 10. Darker squares represent higher values 
of e jk - 

Introducing a cutoff in the degree distribution also pro- 
vides us with a parameter, namely /c, that can, as we 
will shortly see, be conveniently manipulated to produce 
a phase transition at which a giant component appears 
in the network. 

We can also make an assortative matrix ejk by writing 

e ^=2 qj q k -ef k \ (34) 

which gives a value for the assortativity coefficient of 
r a = —rd, while still having the same degree distribu- 
tion. More generally, we would like to be able to vary r 
freely, keeping the degree distribution fixed. We can do 
this by writing ejk in the form 

ejk = qjQk + ra^m lk , (35) 

where the symmetric matrix rrijk has all row and column 
sums zero and is normalized such that 

^jkm^ = 1. (36) 

For any choice of rrijk satisfying these constraints, 
Eq. (|35ll gives us a one-parameter family of networks 
parametrized by the assortativity coefficient r. We can 
for example choose 

rrijk ~ QjQk ~ ejfc = (ij - Xj)(q k -x k ), (37) 

for any correctly normalized x k - Then Eq. (|35|l allows 
us to interpolate smoothly between e$ and e^} (and 
beyond) by simply varying the value of r. Note that 
whenever r = 0, we get a simple random graph without 
degree correlations, of the type discussed by Molloy and 
Reed [3lJ and others [7jJ . 




cutoff parameter K 



FIG. 3: Monte Carlo simulation results for the size of the 
giant component in graphs with the degree distribution 133ft . 
as a function of the cutoff parameter k, with r = 2.5 and 
«' = 2. The individual curves are for different values of the 
assortativity, as marked. Each data point is an average over 
100 graphs of 100 000 vertices. 

C. Simulation results 

For ejk chosen according to Eqs. (|35|l and l|37|l . with 
qk and Xk taking the same truncated power-law form as 
in Fig. |21 we have performed simulations for a variety of 
values of the parameters r, k, and k' . 

In Fig. we show the size of the largest component in 
the graph as a function of k, for three different values of 
the assortativity coefficient r. As the cutoff parameter k 
increases, the mean degree of the graph increases also, so 
that the graph becomes more dense, ultimately passing 
the critical point at which a giant component develops. 
The figure reveals two findings of particular note. 

1. The position of the phase transition at which the 
giant component appears moves to higher values 
of k as the value of r decreases. That is, the more 
assortative a network is, the lower the edge density 
at which a giant component first appears. This is 
intuitively reasonable. In assortatively mixed net- 
works, the high-degree vertices tend to associate 
preferentially with one another, sticking together 
and forming what in the epidemiological literature 
is called a "core group." Within this core group the 
edge density is higher than it is in the graph as a 
whole, since the vertices in the group have higher- 
than-average degree. Thus one would expect to see 
a giant-component forming in the core group before 
it would form in a graph of the same average den- 
sity but with no assortative mixing. Conversely, in 
graphs that are disassortatively mixed, the phase 
transition happens at a higher density than in neu- 
trally mixed graphs. 

2. The size of the giant component in the limit of 
large k is smaller for the assortatively mixed graph 



10 




fraction of vertices removed 

FIG. 4: The size of the largest component in a network as a 
fraction of system size as the highest-degree vertices are re- 
moved one by one. Each curve is for a single network of 10 7 
vertices generated using the Monte Carlo method described in 
the text, with the degree distribution and the matrix ejk cho- 
sen according to Eqs. (Hit and 13311 . with r = 2.5, k = 500, 
ft' = 5, and r-values as marked. Clearly the most assorta- 
tively mixed network (r = 0.2) is considerably more robust 
against the removal of vertices than the most disassortative 
(r = -0.2). 

than for the neutral and disassortative ones. While 
this might seem at first to be at odds with the re- 
sult that assortative graphs show a phase transition 
at lower density, it is really a reflection of the same 
underlying mechanism. Although the presence of a 
core group in an assortative graph allows it to per- 
colate at a lower average density than other graphs, 
it also means that the density in other parts of the 
graph, outside the core group, is lower, and hence 
that the giant component is unlikely to extend into 
those regions. Thus the giant component is con- 
fined to the core of the network, and cannot grow 
as large as in a neutral or disassortative network. 

A question of considerable interest in the study of net- 
worked systems is that of network resilience to the dele- 
tion of vertices. Suppose vertices are removed one by 
one from a network. How many must be removed be- 
fore the giant component of the network is destroyed and 
network communication between distant vertices can no 
longer take place? Many networks, particularly those 
with highly skewed degree distributions, are found to be 
resilient to the random deletion of vertices but suscepti- 
ble to the targeted deletion specifically of those vertices 
that have the highest degrees M, HH Li3 ■ As we now 
show, these general results are modified by the presence 
of assortative mixing in the network. 

In Fig. we show the size of the largest component 
for five networks with different values of r as vertices are 
removed in decreasing order of their degree — i.e., highest 
degree vertices first 80]. As the figure shows, although 
each of the networks has the same degree distribution, 
there is dramatic variation in the resilience of the net- 



works with their assortativity. For the most assortative 
network, with r = 0.2, it requires the removal of about 
ten times as many high-degree vertices to destroy the 
giant component as for the most disassortative one, with 
r = —0.2, even though the disassortative network starts 
out with a giant component about twice as big. 

This finding fits naturally with our picture of an assor- 
tative network as dominated by a core group of intercon- 
nected high-degree vertices. Such a core group provides 
robustness to the network by concentrating all the obvi- 
ous target vertices together in one portion of the graph. 
Removing these high-degree vertices is still one of the 
most effective ways to destroy network connectivity, but 
it is less effective because by removing them all from the 
same portion of the graph we fail to attack other portions. 
And if those other portions are themselves percolating, 
then a giant component will persist even as the highest 
degree vertices vanish. 

Conversely, the disassortatively mixed network is par- 
ticularly susceptible to removal of high-degree vertices 
because those vertices are strewn far apart across the 
network, so that attacking them attacks all parts of the 
network at once. 

One can also ask about the resilience of networks un- 
der random failure of their vertices (rather than targeted 
attack) HElIlJ- Although we do not treat this case 
in detail here, it is reasonable to suppose that it is sim- 
ilar to the case of targeted attack. If assortative mixing 
makes networks more resilient against removal of their 
highest-degree vertices, then presumably they will also be 
resilient against removal of random ones; random vertex 
failure will do most damage when it happens to hit high- 
degree vertices, but, as we have seen, this vulnerability is 
diminished by the concentration of the high-degree ver- 
tices in the core group. Some qualitative behaviors of the 
system may be unaffected by assortativity, however. For 
example, it is known that the fraction of vertices that 
must randomly fail to destroy the giant component in 
a network with a power-law degree distribution and un- 
corrected degrees tends to unity as graph size becomes 
large, provided the exponent of the power law satisfies 
r < 3 10]. Vazquez and Moreno |6^] have recently shown 
that this result is not affected by the presence of assor- 
tative mixing by degree in the network, although disas- 
sortative mixing can make a difference. 



D. Discussion 

The results found here could have applications in a 
variety of areas. Consider for example, the spread of dis- 
eases on networks, which has been the subject of much 
attention in the recent networks literature [&4l l65l Ifjrl 
El IH El E3, E3- The largest component of the con- 
tact network over which a disease spreads represents the 
largest possible disease outbreak on that network, and a 
network with no giant component cannot show an epi- 
demic (system-wide) outbreak. Thus our finding that a 
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giant component forms more easily in a network that is 
assortatively mixed by degree suggests that in such net- 
works epidemic outbreaks would become possible at a 
lower edge density than in the corresponding disassorta- 
tive network. In the language of epidemiology, the core 
group of an assortatively mixed network forms a "reser- 
voir," which can sustain an outbreak of the disease even 
when the density of the network as a whole is too low to 
do so. On the other hand, the smaller asymptotic size of 
the giant component in an assortatively mixed network 
seems to imply that, when they occur, epidemics in such 
networks would be restricted to a smaller segment of the 
population than in a similar disassortative network — the 
outbreak is confined mostly to the core group and does 
not spread to the population as a whole. Thus from the 
epidemiological point of view there are both good and 
bad sides to the phenomenon of assortativity. 

One could test these predictions explicitly by study- 
ing epidemic models such as SIR or SIRS models [73, UM 
on assortatively mixed model networks of the type intro- 
duced here. Some studies of this kind have already been 
carried out — see for example Refs. an( A — although 
the particular conjectures put forward here have not been 
conclusively verified. 

Our findings on network resilience also have some prac- 
tical implications. In the context of epidemiology, for in- 
stance, removal of vertices from the network might cor- 
respond to immunization of individuals to prevent the 
spread of disease. Assuming that the goal of a vaccina- 
tion program is to destroy network connectivity so that 
the disease in question cannot spread, our findings sug- 
gest that even targeted vaccination strategies would be 
less effective in assortative networks than in disassorta- 
tive or neutral ones because of the resilience of the net- 
work to this type of attack. In other contexts, however, 
resilience is a good thing. For example we would like 
to make networks such as the Internet and other com- 
munication or distribution networks resilient against at- 
tacks on their vertices. In this context assortative mixing 
would be beneficial. 

Unfortunately, when we look at Tabled we find a dis- 
couraging picture. As we pointed out in Section IIII Al 
almost all the social networks we have looked at are sig- 
nificantly assortative, meaning that they would be robust 
to vertex removal. But these are the very networks by 
which disease spreads, the ones that we would like to 
be able to attack using vaccination strategies. Even the 
email network, which is relevant to the spread of com- 
puter viruses [47]], is assortative and hence resilient. On 
the other hand, the technological networks like the In- 
ternet, which we would like to be able to protect, are 
disassortative, and hence particularly vulnerable to tar- 
geted attack. 



IV. CONCLUSIONS 

In this paper we have studied the phenomenon of as- 
sortative mixing in networks, which is the tendency for 
vertices in networks to connect preferentially to other 
vertices that are like them in some way. This preference 
may take a number of forms. Mixing may follow discrete 
or enumerative characteristics. In the social networks 
that have been the main focus of this paper, connections 
between people may be assortative by language, for ex- 
ample, or by race — people may prefer to associate with 
others who speak the same language as they do or are of 
the same race. Mixing can also be dictated by scalar char- 
acteristics such as age or income. A special case of mixing 
by a scalar characteristic is mixing according to vertex 
degree, which has been shown previously to be present 
in a variety of networks, including non-social ones such 
as the Internet and protein interaction networks. Mixing 
can also be disassortative, meaning that vertices in the 
network preferentially form connections to others that 
are unlike them. 

We have proposed some simple measures for these 
types of mixing, which we call assortativity coefficients. 
These measures are positive or negative for assortative or 
disassortative mixing respectively, and zero for neutrally 
mixed networks. Applying our measures to a broad se- 
lection of network data drawn from various real-world 
situations we have shown that the phenomenon of assor- 
tative mixing is indeed widespread, with only a few of 
the networks studied showing no statistically significant 
biases in their mixing patterns. In the case of mixing by 
vertex degree, a remarkable pattern emerges. Almost all 
the social networks studied show positive assortativity 
coefficients while all other types of networks, including 
technological and biological networks, show negative co- 
efficients, i.e., disassortative mixing. Only three networks 
that showed no significant trend either way failed to fol- 
low this rule. We have offered some conjectures about 
the origin of this striking regularity, but we believe it un- 
likely that any single mechanism can explain the mixing 
patterns of all of these disparate networks. 

We have also proposed a number of models of assorta- 
tively mixed networks, for mixing both by discrete and by 
scalar characteristics. For each of the mixing types con- 
sidered it is possible to create random graph models for 
which one can calculate exactly by generating function 
methods certain average properties of network ensem- 
bles. We have also described Monte Carlo methods for 
generating random graphs drawn from each of the classes 
discussed with specified values of the mixing parameters. 

For the case of mixing by vertex degree we have per- 
formed extensive simulations. Two results of particular 
interest emerge from these studies. First, we find that 
networks that are assortatively mixed by degree percolate 
more easily that their disassortative counterparts. That 
is, a giant component of connected vertices forms in the 
network at lower edge density than in another network 
with the same degree distribution but zero or negative 
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assortativity. This result may imply, for instance, that 
assortatively mixed social networks would support epi- 
demic disease outbreaks more easily than disassortatively 
mixed ones, which would be a disheartening conclusion, 
given our finding that most social networks appear to be 
assortative. 

Second, we find that assortatively mixed networks are 
more robust to the deletion of their vertices than disassor- 
tatively mixed or neutral networks. We have studied in 
particular the case of the targeted deletion of the highest- 
degree vertices, which has been suggested as a possible 
vaccination strategy for breaking up networks of disease- 
causing contacts, but it is reasonable to suppose that the 
same result will extend also to the random deletion of 
vertices. This result too leads to a rather gloomy conclu- 
sion: targeted vaccination strategies may be less effective 
than we would hope in preventing disease because of the 
assortative and hence resilient nature of social networks, 
while on the other hand networks that we would hope 
to protect against vertex removal, communication net- 
works like the Internet, for instance, will be particularly 
susceptible because of their disassortative nature. 



where as before n is the number of vertex types, and 
we have made use of J2i bi = 1- Like our measure, this 
measure is for a neutrally mixed network and 1 for a 
perfectly assortative network. In general, however, the 
values of the two measures are quite different. Here we 
give a simple example to illustrate the difference between 
the two. 

Consider a network with three types of vertices. There 
are 100 vertices of type 1, 100 of type 2, and 2 of type 3. 
The vertices of types 1 and 2 mix indiscriminately with 
one other — connections from 1 to 2 are as likely as from 
1 to 1, and so forth. The vertices of type 3 however 
associate only among themselves and not with types 1 
and 2 at all. This is reflected in the matrix e for the 202 
vertices, which for a network with mean degree 2 would 
look like this: 



, /50 50 0\ 

e = 50 50 . 

202 ^0 2) 
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APPENDIX A: THE ASSORTATIVITY 
MEASURE OF GUPTA ET AL. 

Gupta et al. |2j| have defined a measure of assortative 
mixing by discrete types different from the one that we 
have introduced in Section III Al In our notation their 
measure is 

q = E» grHgjj - a * b i) = E t en/m - 1 (Al) 
n — 1 n — 1 



Clearly most of this network — 99% of it, in fact — is mix- 
ing randomly, and hence we would expect the assortativ- 
ity coefficient to be close to zero. The value of r for the 
matrix above reflects this; we find r = 0.029. The mea- 
sure of Gupta et al. [25J, however, takes a value Q = 0.50. 
This appears to indicate that the network has very strong 
assortative mixing, when in fact it does not. The reason 
for this is that the measure of Gupta et al., rather than 
giving each vertex in the network equal weight, weights 
each type of vertex equally, so that vertices that be- 
long to large groups get less weight in the calculation 
than those in small groups. In the present case, where 
one group is very small, the vertices in that group are 
weighted very heavily, and since those vertices mix per- 
fectly assortatively, the value of Q is, as a result, high. 
If we remove these vertices from the network, the value 
of Gupta et aZ.'s Q coefficient jumps to zero. Thus the 
two vertices in the third group have a disproportionately 
large effect on the value of Q. 

The solution to this problem is to give each vertex 
equal weight in the calculation, which is precisely what 
our measure r does. 
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